Incorporating pass-phrase dependent background models for text-dependent speaker verification
نویسندگان
چکیده
In this paper, we propose a pass-phrase dependent background model (PBM) for text dependent (TD) speaker verification (SV) to integrate pass-phrase identification process (without an additional separate identification system) in the conventional TD-SV system, where a PBM is derived from a text-independent background model through adaptation using the utterances of a particular pass-phrase. During training, pass-phrase specific target speaker models are derived from the particular PBM using the training data for the respective target model. While testing, the best PBM is first selected for the test utterance in the maximum likelihood (ML) sense and following the selected PBM is used for the log likelihood ratio (LLR) calculation with respect to the claimant model. The proposed method incorporates the pass-phrase identification step in the LLR calculation, which is not considered in conventional standalone TD-SV based systems. The performance of the proposed method is compared to conventional text-independent background model based TD-SV systems using a Gaussian mixture model (GMM)-universal background model (UBM), Hidden Markov model (HMM)-UBM and i-vector paradigms. In addition, we consider two approaches to build PBMs: one is speaker independent and the other is speaker dependent. We show that the proposed method significantly reduces the error rate of text dependent speaker verification for the non-target types: target-wrong and imposter-wrong while it maintains comparable TD-SV performance when imposters speak a correct utterance with respect to the conventional system. Experiments are conducted on the RedDots challenge and the RSR2015 databases which consist of short utterances.
منابع مشابه
Parallel Speaker and Content Modelling for Text-Dependent Speaker Verification
Text-dependent short duration speaker verification involves two challenges. The primary challenge of interest is the verification of the speaker’s identity, and often a secondary challenge of interest is the verification of the lexical content of the pass-phrase. In this paper, we propose the use of two systems to handle these two tasks in parallel with one subsystem modelling speaker identity ...
متن کاملOn the need of template protection for voice authentication
In this work we study the need of template protection to provide security and privacy in text-dependent pass-phrase voice authentication systems. For this purpose, we analyze the robustness of two state-of-the-art speaker verification systems against attacks performed using input data generated from a compromised voice template. This analysis shows that compromised templates can be used to gain...
متن کاملGeneral phrase speaker verification using sub-word background models and likelihood-ratio scoring
We present a design and study the performance of a text-dependent speaker veri cation system using general phrase passwords. The text of the password utterance and its phone transcription are assumed to be available. The problems that are addressed include the appropriate choice of units for building target speaker models and the choice of background models for likelihoodratio scoring.
متن کاملText-Dependent Speaker Verification System in VHF Communication Channel
Text-independent speaker verification can reach high accuracy provided that there are sufficient amount of training and test speech utterances. Gaussian mixture model universal background model (GMM-UBM), joint factor analysis (JFA) and identity-vector (i-vector) represent the dominant techniques used in this area in view of their superior performance. However, their accuracies drop significant...
متن کاملOn the Influence of Text Content on Pass-Phrase Strength for Short-Duration Text-Dependent Automatic Speaker Authentication
In the context of automatic speaker verification it is well known that different speech units offer different levels of speaker discrimination. For short-duration, text-dependent automatic speaker recognition, a user’s pass-phrase bears influence on how reliably they can be recognized; just as is the case with text passwords, some spoken pass-phrases are more secure than others. This paper inve...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computer Speech & Language
دوره 47 شماره
صفحات -
تاریخ انتشار 2018